理论#
- [2026.01] From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence
数据清洗#
- [2025.04] Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models 四维质量评估(PRRC); Meta-rater 方法训练多个代理小模型从多个维度打分,最后选出综合质量更高的数据。
- [2024.02] Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation 用rm对qa对打分然后排序。pca降维,kmeans聚类。
- [2023.12] What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning deita。complexity, quality, and diversity。用gpt来给指令和QA对打复杂度和质量分,用emb_sim来评估相似度。
- [2023.08] InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models instag